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Abstract — We extend quantum rate distortion theory by con- 
sidering auxiliary resources that might be available to a sender 
and receiver performing lossy quantum data compression. The 
first setting we consider is that of quantum rate distortion coding 
with the help of a classical side channel. Our result here is 
that the regularized entanglement of formation characterizes 
the quantum rate distortion function, extending earlier work 
of Devetak and Berger. We also combine this bound with the 
entanglement-assisted bound from our prior work to obtain the 
best known bounds on the quantum rate distortion function for 
an isotropic qubit source. The second setting we consider is that 
of quantum rate distortion coding with quantum side information 
(QSI) available to the receiver. In order to prove results in this 
setting, we first state and prove a quantum reverse Shannon 
theorem with QSI (for tensor-power states), which extends the 
known tensor-power quantum reverse Shannon theorem. The 
achievability part of this theorem relies on the quantum state 
redistribution protocol, while the converse relies on the fact that 
the protocol can cause only a negligible disturbance to the joint 
state of the reference and the receiver's QSI. This quantum 
reverse Shannon theorem with QSI naturally leads to quantum 
rate-distortion theorems with QSI, with or without entanglement 
assistance. 

Index Terms — quantum rate distortion, quantum side infor- 
mation, entanglement of purification, isotropic qubit source, 
quantum reverse Shannon theorem 



I. Introduction 

Schumacher proved that the optimal rate of data compres- 
sion of a memoryless, quantum information source is given 
by its von Neumann entropy [1 1. This data compression limit 
was evaluated under the requirement that the data compression 
scheme is lossless, in the sense that the information emitted by 
the source is recovered with arbitrary precision in the limit of 
asymptotically many copies of the source. However, the lack 
of sufficient storage could make it necessary to compress a 
source beyond its von Neumann entropy. By the converse of 
Schumacher's theorem, this would mean that the information 
recovered after the compression-decompression scheme would 
suffer a certain amount of distortion compared to the original 
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information. In other words, the data compression scheme 
would be lossy. 

The theory of lossy quantum data compression is called 
quantum rate distortion theory, in analogy with its classical 
counterpart developed by Shannon [2|. It deals with the trade- 
off between the rate of compression and the allowed distortion. 
The trade-off is characterized by a rate distortion function 
which is defined as the minimum rate of data compression for 
a given distortion, with respect to a suitably defined distortion 
measure. 

In the first paper to discuss quantum rate distortion theory, 
Bamum considered a symbol-wise entanglement fidelity as 
a distortion measure Q. With respect to it, he obtained a 
lower bound on the quantum rate distortion function in terms 
of an entropic quantity, namely, the coherent information 0]. 
Even though this was the first result in quantum rate distortion 
theory, it is unsatisfactory since the coherent information can 
become negative, whereas the rate distortion function, by its 
very definition, is always non-negative. 

In [5|, we obtained an expression for the quantum rate 
distortion function in terms of the entanglement of purification 
||6l . which, in contrast to the coherent information, is always 
non-negative. However, our result too is not entirely satisfac- 
tory since the expression is given in terms of a regularized for- 
mula and hence cannot be effectively computed. Furthermore, 
there is recent evidence that the entanglement of purification is 
a non-additive quantity \7\, which if true would lead to further 
complications in evaluating the expression. The search for a 
single-letter formula for the quantum rate distortion function 
hence remains an important open problem. 

It is often convenient to consider data compression in 
the communication paradigm, in which a sender (say, Alice) 
compresses the information emitted by the quantum informa- 
tion source and sends it to a receiver (say, Bob) who then 
decompresses it. In this setting, one considers Alice and Bob 
to have additional, auxiliary resources which they can employ 
to assist them in their compression-decompression task. One 
such auxiliary resource is prior shared entanglement between 
Alice and Bob. In |5|, we proved that in its presence, the 
quantum rate distortion function is characterized by a single- 
letter expression in terms of the quantum mutual information. 
This expression obviously provides a single-letter lower bound 
on the unassisted quantum rate distortion function, since, for 
any given distortion, the extra resource could in principle 
allow for improved compression. Furthermore, this result 
demonstrates that the coherent information (at least in the 
form suggested by Barnum [3]) is irrelevant for the task 
of unassisted quantum rate distortion coding because half 
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the quantum mutual information is a lower bound on the 
unassisted quantum rate distortion function and it is also an 
upper bound on the coherent information. 

II. Summary of Results 

In this paper, we consider rate-distortion coding in the 
presence of other auxiliary resources, e.g., access to a noiseless 
classical side channel or quantum side information. Doing so 
not only provides new bounds on the unassisted quantum rate 
distortion function, but it also offers new scenarios that are 
unique to the quantum setting. 

Alice and Bob are said to have a noiseless, forward classical 
side channel if Alice is allowed unlimited classical communi- 
cation to Bob. Quantum rate distortion in the presence of such 
an auxiliary resource was studied for the special case of an 
isotropic qubit source - i.e., one that produces a maximally 
mixed state on a qubit system - by Devetak and Berger (8). 
We consider the general case of an arbitrary, memoryless 
quantum information source, and prove that the corresponding 
rate distortion function is given in terms of a regularized 
entanglement of formation |9). This classically-assisted rate 
distortion function serves as an alternate lower bound to the 
unassisted quantum rate distortion function, and we show that 
it can be tighter than the above entanglement-assisted lower 
bound in some cases. 

Quantum rate distortion coding in the presence of quantum 
side information (QSI) corresponds to the following setting: 
Suppose a third party (say, Charlie) maps the source state p 
via some isometry to a bipartite state pab, and distributes the 
systems A and B to Alice and Bob, respectively. The goal 
is for Alice to transfer her system A to Bob, up to some 
given distortion, using as little quantum communication as 
possible. The rate distortion function is then defined as the 
minimum rate of quantum communication required for this 
task, evaluated in the limit in which Alice and Bob share 
asymptotically many copies of the state pab- We obtain an ex- 
pression for the rate distortion function under the assumption 
that the protocol causes asymptotically negligible disturbance 
to the joint state of Bob and a reference system R that purifies 
the state pab- This assumption may be motivated naturally in 
light of the fact that Bob may want to reuse his quantum 
side information in some future information processing task. 
Furthermore, if we allow Alice and Bob to have sufficient 
prior shared entanglement in addition to Bob's QSI, then, 
under the above assumption, the rate distortion function is 
given by a single-letter expression in terms of the quantum 
conditional mutual information. A classical analogue of the 
above problem was solved by Wyner and Ziv ifTUl . Our results 
on quantum rate distortion with QSI also generalize Luo and 
Devetak's results on classical rate distortion in the presence of 
QSI [11] and our prior results in |[T2l on quantum-to-classical 
rate distortion coding with QSI. 

The techniques used to prove these results are generaliza- 
tions of those which we employed in Q to obtain expres- 
sions for the unassisted and entanglement-assisted quantum 
rate distortion functions. The main ingredient in the proof 
of the achievability part for the entanglement-assisted case 



in is the quantum reverse Shannon theorem lfT3l . [14|, 
which quantifies the minimum rate of quantum communication 
required from Alice to Bob in order to asymptotically simulate 
a memoryless quantum channel, when they share entangle- 
ment. (It has been pointed out in both ifTSI and lfl6l how a 
reverse Shannon theorem immediately leads to a rate distortion 
protocol.) Analogously, to establish the achievability of our 
rate distortion functions in the presence of QSI, we employ 
a generalization of the quantum reverse Shannon theorem to 
the case in which Bob has QSI as an auxiliary resource. This 
theorem constitutes a result which is interesting in its own 
right, and the protocol of quantum state redistribution [17|, 
ifTSl plays a key role in the proof of the achievability part. The 
achievability of the rate distortion function in the unassisted 
case was proved by using Schumacher compression [5], which 
can be viewed as a special reverse Shannon theorem where 
the goal is to simulate the identity channel. Analogously, the 
achievability of our rate distortion function in the presence of 
a classical side channel is proved by exploiting a variant of 
Schumacher compression. 

The converse proofs of the results in this paper have certain 
similarities, using various identities and entropic inequalities, 
e.g., the quantum-data processing inequality J4], superaddi- 
tivity of the quantum mutual information, and the Alicki- 
Fannes inequality [19]. However, a non-trivial aspect of the 
QSI converse proofs is that they exploit the assumption that the 
protocol causes only a negligible disturbance to the joint state 
of the reference system and Bob's QSI. We can then invoke 
Uhlmann's theorem GUI as in Refs. J2TJ, lfT2l to demonstrate 
the existence of a map which does not act at all on Bob's 
quantum side information, such that the overall map on the 
source state is close to the original map that acts in part on 
Bob's quantum side information. As such, this feature of the 
converse proof is unique to quantum information theory and 
simply is not present in related classical results fTOl . 
The paper is organized as follows. 
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In Section 

introduce necessary notation and definitions, especially for the 
entropic quantities arising in the statements of the theorems. In 



Section IV we introduce the basic concepts of quantum rate 



distortion theory, and unify our results from [5| on unassisted 
and entanglement-assisted quantum rate distortion functions in 
order to obtain a trade-off between the rate of compression and 
the rate of entanglement consumption. We analyse quantum 
rate distortion in the presence of a classical side channel 
in Section [V] and in the presence of QSI in Section VI 



The latter section also contains our theorem on the rate 
distortion function in the presence of both QSI and prior shared 
entanglement. The necessary generalizations of the quantum 
reverse Shannon theorem required for our proofs are stated 



and proved in Section VI-B We conclude in Section VII with 



a summary and some open questions. 

III. Notations and Definitions 

Let B(H) denote the algebra of linear operators acting on 
a finite-dimensional Hilbert space H, and let T>(H) denote 
the set of positive operators of unit trace (density operators) 
acting on H. For any given pure state \if>) £ H we denote the 
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projector simply as ift. The trace distance between two 

operators M and N is given by 

\\M-N\\i=Tr(\M-N\), 

where \C\ = VcUJ. Throughout this paper we restrict our 
considerations to finite-dimensional Hilbert spaces, and we 
take the logarithm to base two. 

We denote the Hilbert space associated to a quantum 
system A by Ha> and the quantum systems corresponding 
to n copies of a pure state tp ABC by A n , B n and C n . For a 
multiparty state pas, we unambiguously refer to its reduced 
states on systems A and B simply as p A and p B , respectively, 
so that it is implicit that pa = Ttb(pab) and Pb = Tta(pab)- 
Moreover, we denote a completely positive trace-preserving 
(CPTP) map TV : B{H A ) -> B(H B ) simply as Na^b- 
Similarly we denote an isometry U : B(Ha) —> B(W B ®T~Le) 
simply as Ua-^be- The identity map on states in V(Ha) is 
denoted as id,4- 

The von Neumann entropy of a state p £ V(Ha) is defined 

as 

H{p) = H{A) p = -Tr{p\ogp), 

and satisfies the bound H(p) < logdim(7^). The conditional 
entropy H(A\B) p , the quantum mutual information I(A; B) p , 
and the conditional quantum mutual information I (A; B\C) P 
of a tripartite state pabc are defined as follows: 

H{A\B) p = H{AB) P -H{B) P , 
I(A;B) p = H(A) p -H(A\B) p , 
I{A;B\C) P = H{A\C) p -H{A\BC) p . (1) 

The entanglement of formation [9| of a bipartite state pab is 
defined as 

E f (pab) = f , min S^p{x)H{A)^, (2) 

where {p(x), \ip AB )} ' s an ensem ble of pure states such that 
Pab = J2 X P( X ) Wab){^ab\' The entanglement of purifica- 
tion [6] of a bipartite state pab is given by the following 
expression: 

E p (p AB ) = miii#((id B <Z>Me^e>){?bb{p)))> (3) 

where <tbe{p) = Tr^(0^ ss ), <j) ABE is some purification of 
Pab, an d the minimization is over all CPTP maps Ne^,e' 
acting on the system E. 

We also employ the following lemmas in our proofs. 

Lemma 1 (Quantum data processing inequality /EyJ." If 
uj ab , = (id A (g) Nb^b>){o- A b), where Af B ^B' is a CPTP 
map, then 

I(A;B) a >I(A;B%. (4) 

Lemma 2 (Alicki-Fannes Inequality H19V ): Suppose two 
states pab and gab are close in trace distance: 

Wpab — o~ab\\ i < £) for some e > 0. 
Then their respective conditional entropies are close: 

\H(A\B) P - H(A\B) a \ < Ae\og\A\ + 2h 2 (e), (5) 



where 112(e) = — eloge — (1 — e)log(l — e) is the binary 
entropy. 

In our converse proofs, we often deal with information 
quantities evaluated on n systems. For simplicity, rather than 
listing the precise bound from the Alicki-Fannes inequality, 
we often just state an upper bound as ne', where it is implicit 
that e' = cielogd + C2 h2(e)/n for positive constants c\ and 
C2, and d being the dimension of a single system. 

Note that it is easy to derive continuity inequalities for 
the quantum mutual information and the conditional quantum 
mutual information from the Alicki-Fannes inequality simply 
by employing the triangle inequality. 

Lemma 3 (Superadditivity of q. mutual information S5]l): 
The quantum mutual information is superadditive in the sense 
that, for any CPTP map Na 1 a 2 ^b 1 b 2 , 

I(RiR 2 ; BiB 2 ) a > I{Ri]Bi) a + I(R 2 ;B 2 ) a , 

where 

vr 1 r 2 b 1 b 2 = Na 1 a 2 ^b 1 b 2 (4>b. 1 a 1 ®¥R 2 A 2 ), 

and 4>r 1 a 1 and <pr 2 a 2 are pure bipartite states. 

We make frequent use of the following lemma in our 
converse proofs. For completeness, we give its proof. 

Lemma 4 (Duality of Conditional Quantum Entropy): For 
a tripartite pure state ipABC, 

H(A\B) i , = -H(A\C)^. 

Proof: 

H(A\B)^ = H(AB) i , - H(B) 4 , 
= H(C)^ - H(AC)^ 
= -H{A\C) i> . 



IV. Quantum Rate Distortion 

Throughout this paper we consider a memoryless quan- 
tum information source characterized by a density matrix 
p G T>(Ha)- We refer to p as the source state and denote 
a purification of it by i/j ra = \4> P ra) (^ra^ w ^ n ^ being 
a purifying reference system isomorphic to A. The state 
p n = p® n G V('H A )n ) characterizes n copies of the source. 
A source coding (or compression-decompression) scheme of 
rate R is defined by a block codeQ which consists of two 
quantum operations — the encoding and decoding maps. The 
encoding £ n is a CPTP map from n copies of the source 
space to a Hilbert space Hq™ of dimension m 2 nR : 

and the decoding T> n is a CPTP map from the compressed 
space to the original Hilbert space "H®": 

V n :V(H Qn )^V(HT)- 

'it should be very clear from the context whether R refers to "rate" or 
"reference system." 
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The average distortion resulting from this compression- 
decompression scheme is defined as an average Q, 0: 

d{p,V n o£ n ) = -Y^d{p,F^), 



i=l 



where J 7 ,^ is the "marginal operation" on the z-th copy of the 
source space induced by the overall operation T n = F> n ° £ n , 
and is defined as 

= ^A 1 Ai...A 4 _ I >i 4+1 ...^[j>.(p® (i - 1) ®e®p 0(n - i) )] J 

(6) 

and for any CPTP map Af, 

d(p,Af) = l-F e (p,Af), 
with F e being the entanglement fidelity of Af: 

p e {p, Af) = {r RA \^ R ®Af A ^ A ){r RA w RA ). (7) 

The quantum operations T> n and £ n define an (n, R) quantum 
rate distortion code. 

Motivated by the observation that d is the average of a 
linear function of the marginal channels, and by Shannon's 
rate distortion theory |2|, which allows a general average-type 
distortion function of input and output, we can generalize the 
above setting as follows |22), J23), El: Let A > be an 
observable on AB, which we will (without loss of generality) 
assume to be non-negative. This distortion observable A will 
define the distortion function between input (A) and output (B) 
of the compression and decompression maps £ n : T>(T-L^ n ) —> 
V(U Qn ) and V„ : V(U Qn ) -> V(H% n ), respectively, with 
A not necessarily equal to B. With this, we can define the 
distortion of a channel Af with respect to a source p, as 

d(p,Af) = TT(A RB ((id R ®Af A ^ B )(^ p RA ))), 
and the average distortion on a block of n as 
1 ™ 

d( P ,v n o£ n ) = -y^dip,^) 

n * — ' 

i=l 

= Tr(A((id«^)((^ A r i ))). 

(i) 

Here, the marginal channels F„ are now given by 

= Tr SlB2 ... Bi _ lBi+1 ... B „[J-„^( l - 1 )®^P^ ( "^ ) )], 

and 



A.i^ 



i=l 

is the average distortion observable. 

Allowing different input and output spaces, and comparing 
them via the otherwise completely arbitrary observable A may 
look like a drastic departure from the source coding paradigm, 
but looking at Shannon's rate distortion theory Q and its 
applications reveals that it is indeed very natural - see also 
the examples below. 

For any R, D > 0, the pair (R, D) is said to be an 
achievable rate distortion pair if there exists a sequence of 
(n,R) quantum rate distortion codes (£ n ,T> n ) such that 



The quantum rate distortion function is then defined as 

R q {D) = mf{R : (R, D) is achievable}. 

In 0, we proved that the quantum rate distortion function ad- 
mits the following characterization in terms of the regularized 
entanglement of purification: 



R q (D) = lim 

k ' 



1 

yo k 



jnin 

d(p,M (k) )<D 



E 



' p (p® k ,AfW) 



(9) 

In the presence of an auxiliary resource, the rate distortion 
function is defined analogously, the only difference being 
in the encoding and decoding maps. In particular, if Alice 
and Bob have prior shared entanglement, then, denoting the 
entangled systems by T A and Tg (with T A being with Alice 
and Tg being with Bob), the encoding and decoding maps are 
respectively given by 



and 



V n :V('Hq~®'Ht b ) 



V{H% n ). 



(10) 



(11) 



We denote the corresponding entanglement-assisted quantum 
rate distortion function, for a given distortion D > and 
unlimited amount of entanglement, as i?| a (D). In Q, we 
proved that the entanglement-assisted quantum rate distortion 
function is equal to the following single-letter expression: 



Ria(D) 



1 



min I(R- B) u 

JV : d(p,M)<D 



lim d(p,V n o £ n ) < D. 



(8) 



where uj rb = Af A ^g{%l) p RA ). 

Remark 5: It should be noted that for every choice of distor- 
tion observable A, the quantum rate distortion function R q (D) 
is convex in the distortion D, and likewise the entanglement- 
assisted rate distortion function R q a (D). This is seen by 
observing that time-sharing codes of rate Rj and distortion Dj 
(j = 1, 2) on fractions A and 1 — A of a block, yields directly a 
code of rate XRi + (1 — A)i?2 and distortion XDi + (1 — A)Z?2- 

By the above observation and the coding theorem expressed 
by (|9]), we can conclude that the regularized expression on 
the RHS of (|9]l is convex in D. This is true, even though 
the convexity of the expression on the RHS of (|9]l is not 
immediately evident, since it is known that the entanglement 
of purification and even its regularization are not convex in 
the state on which it is being evaluated [6|. In fact, all the 
coding theorems in this paper contain expressions for the 
rate distortion function (or for rate regions) that are convex 
in the distortion parameter D. Indeed, one well-known way 
of proving convexity of an expression for a rate-distortion 
function is as outlined in Lemma 14 of |5|, and this approach 
relies on convexity of the underlying information measure 
with respect to a distortion channel. Thus, for any finite 
k, the expression in ^ is not convex in D because the 
entanglement of purification is not convex, and it is only in the 
regularized limit that this expression is convex in D. For the 
curious reader, we show in Appendix [A] that the mathematical 
expression on the RHS of |9]l is convex in D. 
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Example 6: The original distortion measure based on en- 
tanglement fidelity is recovered in the case where A = B, by 
letting the distortion observable A = 1 — V> p - 

Example 7: Given a classical distortion function d : X x 
y — » M>o, as considered in [2|, for input and output alphabets 
X and y, we consider Ha = and Hb — C^, and let 

A = 53d(a:,y)|a!)<a;|(»|y)<tf|. 

In classical rate distortion theory we also consider an IID 
source with single-letter marginal probability distribution 
P(x), giving rise to the diagonal source density p = 
J2x P ( x )\ x )( x \ and its purification \ip p ) = V P ( X )\ X )\ X )- 

Now, a classical source coding scheme of rate R and 
distortion D naturally turns into a source code in the above 
quantum sense, by lifting the stochastic encoding and decoding 
maps to CPTP maps sending diagonal matrices to diagonal 
matrices; furthermore the quantum version still has rate R 
(now qubits) and the same distortion D. 

Conversely, given a source code in our above sense, the 
relation to classical Shannon-style rate distortion coding is a 
little more subtle. To start, however, we can at least say that 
without loss of generality the overall map T n is a classical 
channel from X n to y n , because we can dephase the input to 
£ n in the x-basis, and the output of T> n in the y-basis, without 
affecting rate or distortion. The compressed system Q n may 
of course still use quantum states in a nontrivial way, for 
instance if unlimited entanglement is available, to "superdense- 
code" lf24ll the classical compressed information of a classical 
rate distortion code, thus halving the bit rate to the qubit 
rate. On the other hand, Theorem 3 of [5| shows that this is 
the only improvement; indeed, the entanglement-assisted rate 
distortion function is exactly half the classical one R{D) in 
[2|. From this we can deduce that the unassisted quantum rate 
distortion function equals the classical one. If we assume the 
opposite, namely that the former were strictly smaller than 
the latter, then by invoking remote state preparation of the 
compressed quantum states at an asymptotic cbit/qubit rate of 
one [25 1, and then "superdense-coding" the classical bits, we 
would get an entanglement-assisted rate distortion code of the 
same distortion but rate \R q (D) < ^R(D), resulting in a 
contradiction. 

Example 8: Quantum-to-classical rate distortion lfT2l is 
based on an observable of the form 

y 

In IP721 a source code for it was defined as a measurement 
(i.e., a qc-channel) taking values in a subset of y n , and as 
in Example [jj these codes can be understood as quantum rate 
distortion codes in the above sense. We add as an aside that 
in the limit of zero distortion, this model can be traced back 
to the work of Massar and Popescu ||26l and of Massar with 
one of us l27l . 

However, given a quantum rate distortion code in our 
present sense, by the same logic as in Example [TJ the overall 
map T n is without loss of generality a qc-channel, and the 
coding part in Theorem 3 of [12 1 achieves the same classical 



communication rates as Theorem 2 in - with the difference 
that only shared randomness rather than entanglement is 
required, which then can be removed entirely. With quantum 
communication assisted by entanglement, we hence get half 
that rate, R% a {D) = \R qc {D). 

Conversely, assuming that the (unassisted) quantum rate 
distortion function were strictly smaller than R qc (D), leads 
to a contradiction along the same lines as in Example |7j we 
could use entanglement to replace the compressed qubits by 
cbits at asymptotic exchange rate 1, and by superdense coding 
would obtain an entanglement-assisted rate distortion code (of 
the same distortion) of rate < ^R qc (D), contradicting our 
conclusion in the previous paragraph. 

In summary, the theory of quantum-to-classical rate distor- 
tion coding is subsumed in the above general framework. 

As reviewed above, in we obtained expressions for the 
quantum rate distortion function R q (D) and the entanglement- 
assisted quantum rate distortion function R q a (D) in terms 
of entropic quantities (note that going to general distortion 
observables does not change the form of these results). By 
unifying these results, we obtain a rate region characterizing 
the quantum communication and entanglement consumption 
that is necessary and sufficient for lossy compression of an 
IID quantum source. This is given by Theorem [9] below. 

Theorem 9: For a memoryless quantum information source 
defined by the density matrix p E T)(T-La) with a purification 
\iPjia), and any given distortion D > 0, the quantum rate 
distortion coding region for lossy source coding with noiseless 
quantum communication, with the help of rate-limited shared 
entanglement at rate E, is given by the union of the following 
regions, letting k become arbitrarily large: 

Q> ~I(R k ;B k E B ) u , 

Q + E > jH(B k E B )uj, (12) 
k 

where the entropic quantities are with respect to the following 
state: 

ur»b>e a e b = V E ^ EAEB {U^ BkEk {{r RA r k )), (B) 

and the union is over all isometric extensions U J ^ { ' of 
CPTP maps A/" (fe) such that d(p,Af^) < D and isometries 

V E k -+E A E B - 

Proof: Our proof of these bounds requires just a slight 
modification of the proofs of the converse theorems in Ref. (5) . 
Indeed, consider the most general protocol for rate-limited 
entanglement-assisted quantum rate distortion coding. The 
protocol begins with the reference and Alice sharing the state 
(^ra)®™- L et R n denote the reference's systems, and let A n 
denote Alice's systems. Alice and Bob share entanglement in 
the systems Ta and Tg before communication begins, and we 
suppose that the logarithm of the dimension of system Tb is 
no larger than nE. Alice acts with an encoder (some general 
CPTP map) on her systems A n , obtaining a system W. She 
then sends W to Bob, who subsequently feeds W and Tg into 
a decoder to produce the system B n . By Stinespring's dilation 
theorem [28], [29 1, we can simulate this protocol by one in 
which Alice's encoder is replaced by an isometric extension 
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of it, with outputs W and an environment E\, and Bob's 
decoder is replaced by an isometric extension of this decoder, 
with outputs B n and an environment E 2 . At the end of the 
simulation, the state on systems R n B n E±E-2 is a state of the 
form in ( fT3] l. 

We can now obtain a lower bound on the rate Q of quantum 
communication as follows: 



and 



nQ = log(diniH 



w 



> H(W) 

= H(WT B ) 

> H(B n E 2 ) 



H{T B \W) 
- nE. 



The first equality is the entropy chain rule. The second inequal- 
ity follows because H(WTb) = H(B n E 2 ) (considering an 
isometric extension of the decoder) and because conditioning 
cannot increase entropy: H(T B \W) < H(T B ) < nE. 
The other bound results as follows: 

2nQ > 2H(W) 

= I(W;R n T B E 1 ) 

> I(W;R n T B ) 

= I(WT B -R n ) + I(W] T B ) - I{R n ;T B ) 

> I(WT B ;R n ) 
= I(B n E 2 ;R n ). 

The first equality follows from the fact that H(W) = 
H(R n T B Ei) and H(WR n T B E 1 ) = for a pure state on sys- 
tems W R n T B E\. The second inequality results from applying 
the quantum data processing inequality. The second equality is 
an identity. The third inequality follows because systems R n 
and T B are in a product state (implying I(R n ; T B ) =0) and 
from the fact that I(W;T B ) > 0. The final equality results 
because entropy is invariant under isometries (in this case, the 
isometric extension of the decoder taking systems WT B to 
B n E 2 . This proves the converse part of this theorem. 

The achievability part of this theorem follows simply by 
picking a map that meets the distortion constraint and applying 
Theorem 3b of iPOl . ■ 

It is worth remarking that in the case there is sufficient 
entanglement available, the above theorem reduces to the 
entanglement-assisted quantum rate distortion function from 
Theorem 3 of Q, while if there is no entanglement avail- 
able, then the above theorem reduces to the entanglement of 
purification characterization from Theorem 5 of [5]. 

V. Classically-Assisted Quantum Rate Distortion 

In this section, we consider quantum rate distortion coding 
in the presence of classical side information. As mentioned 
in the Introduction, this corresponds to the scenario in which 
Alice is allowed unlimited, forward classical communication 
to Bob to assist them in their compression-decompression 
task. We refer to the corresponding rate distortion function 
as the classically-assisted quantum rate distortion function 
and denote it by R?_ ¥ (D) for a given distortion D > 0. It 



is defined analogously to R q (D) (see Section IV I, except that 



the encoding and decoding maps are now given by 



where Hx denotes the Hilbert space associated with the 
classical information that Alice sends to Bob. Like the previous 
rate distortion functions, also R'L t (D) is convex for any given 
distortion observable A. 

We prove the following theorem, which gives an expression 
for RS^ (D) in terms of the entanglement of formation defined 
in 0. 

Theorem 10: For a memoryless quantum information 
source defined by the density matrix p 6 T>('Ha)> an d an y 
given distortion D > 0, the quantum rate distortion function 
assisted by unlimited, forward classical communication is 
given by 

i^(U)=lim r min \E P {p 9k , Af W )l , (14) 



hm — mm 

d(p,J^( k ) )<D 



where M {k) : V(hf k ) -> V(U% k ) is a CPTP map, and 

E F (p,M) =E F (w RB ) (15) 
denotes the entanglement of formation of the state 



uj rb = (id R <g> JV A ^ B )(ip p RA ). 



(16) 



f 



v(nT) 



V(H Qn ®H x ), 



Proof: The achievability part of the above theorem was 
essentially proven by Devetak and Berger [8 1 for the particular 
case of a source of isotropic qubits, even though they did not 
express their result explicitly in the form of the entanglement 
of formation. Moreover, they did not give a general converse 
proof. For the sake of completeness, we include a proof of 
achievability in addition to giving a proof of the converse. 

To prove the achievability part, our approach is the same as 
that of Devetak and Berger (8), namely, to exploit a variant of 
Schumacher compression with classical communication JT|- 
To start with, consider k = 1 on the RHS of ( p"4| ), and fix 
the CPTP map M = j\f^ such that the minimization on the 
RHS of this equation is achieved. Every Kraus decomposition 
of this map Af A ^B as where J2 x A l A * = !> 

leads to a pure-state decomposition of the state 0Jit B : 

Mrs — Y,{lR®A x )(r RA )(I R ®Al). 

x 

Note that each operator (I R ® A x )(ip RA )(I R eg) Aj.) is of rank 
one, so that each normalized version is a pure state. Consider 
the following extension of the above state: 

^RBX — Y,(Ir ® A x )(r RA )(I R ® At) ® \x)(x\ x , (17) 

x 

where X denotes a classical register. Note that the above state 
can be considered to result from the action of a quantum 
instrument on ip RA since it has both a quantum and a classical 
part. 

The entanglement of formation of the state oj rb is then 
equal to 

Ef(urb) = mm H(B\X) U , (18) 

{-4*} 

where the minimization is over the choice of Kraus operators. 
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The protocol proceeds as follows. To start with, the refer- 
ence and Alice share n copies of ip RA , which is a purification 
of the source state p G V(H A ). Alice determines the Kraus de- 
composition of the CPTP map Af (chosen as described above) 
that minimizes the conditional entropy H(B\X) U> , Henceforth, 
we denote the corresponding set of Kraus operators simply 
as {Aa;}. On every copy of the source state, she performs 



the quantum instrument given by ( 17 1. She then measures the 



classical register X of each copy of the resulting state ujrbx, 
thus obtaining a classical sequence x n = (xi, x-i, ■ ■ ■ , x n ), 
where xi denotes the outcome of measuring the X register 



of the i m copy of oj rrx . From illl it is clear that the 



probability that Alice gets an outcome x upon measuring an 
X register is given by px(%) = Tr(^4J,A x p). In the large n 
limit, with high probability, there are approximately npx(x) 
states in the length n sequence such that the outcome of the 
measurement is x (i.e., the sequence x n will be strongly typical 
[29 1 with very high probability). If the sequence Alice obtains 
is not strongly typical, she aborts the protocol. Otherwise, 
she groups together the states in the length n sequence with 
the same measurement outcome and performs Schumacher 
compression on each of these blocks, compressing each block 
to approximately npx(x)H(B) Px qubits, where 



1 



Px 



Px(x) 



A xP Al e V(H B ), 



and H(B) Px = H(p x ). She then sends these qubits to 
Bob, along with the classical sequence x n representing her 
measurement outcomes. 

Thus, the total rate at which she sends qubits to Bob is 
given by (fT~8]> because 



Y,Px{x)H{B) Px = H(B\X) 



Conditional on the sequence x n that he receives, Bob decom- 
presses the qubits in each block (according to Schumacher 
decompression) and finally discards the classical sequence. 
The result of this protocol is that, for n large enough, a state 
very close to uj R g is shared between the reference and Bob. 

One could then execute the above protocol by blocking k 
of the states together and by having the CPTP map to be of 
the form Af {k) : V{U Ak ) -> V{U B u), (where U a h = 
and %B k — T~L% k ) acting on each block of k states. By letting 



k become large, such a protocol leads to the rate in ( 14 1 for 
classically-assisted quantum rate distortion coding. 

The converse part of the theorem is proved as follows. The 
most general protocol begins with many copies of the state 
ip RA being shared between the reference and Alice. The most 
general map that Alice can perform is a quantum instrument 
from A n to a quantum system W and a classical system M. 
Let this be described by a set of trace non-increasing maps 
{Em}, with J2 m ^ m = ^ She ser, ds the quantum system 
W and the classical message M to Bob. Hence the rate of 
quantum communication is given by Q = (l/n) \og(dimHw)- 
Bob then performs a CPTP map from WM to B n . For him, 
performing a CPTP map on a classical system M and a 
quantum system W is equivalent to performing CPTP maps 
T> m , on the quantum system W, conditional on the value m 



of the classical register M (see, e.g., 1 30 1 or Section 4.4.8 of 
Ref. [29]). Let a RnR n denote the state shared by the reference 
and Bob at the end of the protocol, and let M. A n ^B n denote 
the classically-coordinated encoding-decoding map: 



a R n B n 

Ma^^b^ 



>B~)m A r n ) 



Note that the quantum instrument employed by Alice can 
be simulated by an isometry followed by a von Neumann 
measurement. Specifically, we can consider Alice to perform 
an isometry from A n to quantum systems W, M', and an 
environment E±, and then do a von Neumann measurement 
on M' to get a classical system M. After tracing over 
the environment E\, the original instrument is recovered. 
However, without loss of generality, we could also consider 
Alice to perform a von Neumann measurement of E\, thus 
obtaining a classical system L. Let u> denote the state at this 
point. Observe that the joint state of R n and W is pure, 
when conditioned on the classical systems L and M. Each 
decoding map T> m can be simulated by Bob by performing 
an isometry U m from W to B" and an environment E 2 - Bob 
could subsequently perform a von Neumann measurement on 
Ei, thus obtaining a classical system K. Let a denote the state 
at the end of the protocol. Note that the state on R n B n is pure 
when conditioned on the classical registers MLK. Figure [T] 
depicts both the original general protocol and the simulation 
of it outlined in this paragraph. 

We can now prove a lower bound on the rate of classically- 
assisted lossy quantum data compression as follows: 



nQ 



\og(dimH w ) > H(W) U 

> H{W\LM) U 
= H(R n \LM) u 

> H{R n \LMK) c 

> E F (a RnB ") 



> 



mm 

MM : 

d(p®",Af("))<D 



E F (p® n ,N {n] ) 



The second inequality follows because conditioning on clas- 
sical variables L and M (after Alice's simulation of the 
encoding) cannot increase entropy. The first equality follows 
because (as stated above) the joint state of R n and W is 
pure, when conditioned on the classical systems L and M. 
The third inequality follows again because conditioning on 
the classical register K cannot increase entropy (note that this 
latter entropy is with respect to the state after Bob's simulation 
of the decoder). Now, for the fourth inequality, as we stated 
above, the variables LMK induce a particular pure-state 
decomposition of the state on R n B n , and by the definition of 
entanglement of formation given in d2}, the conditional entropy 
of this particular pure-state decomposition cannot be larger 
than the minimal one given by the entanglement of formation. 
The final bound follows because the map Ylm ^ m ° ^ m 
is a particular CPTP map meeting the distortion constraint 
d(p® n , N^) < D, and thus the entanglement of formation 
for the state resulting from this map cannot be larger than 
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(a) 




(b) 



Fig. 1. (a) A general protocol for quantum rate distortion coding with the assistance of a forward classical side channel from Alice to Bob. (b) A simulation 
of the general protocol in (a), in which Alice and Bob respectively act with isometric extensions of the encoder and decoder in (a). 



the entanglenent of formation of the state resulting from the 
optimal map meeting the distortion constraint. ■ 
We remark that the proof of the achievability part exploits 
a strategy which is similar in spirit to that used in the proof 
of the reverse Shannon theorem (see Refs. (9), E), [32 1, for 
example). In particular, we just pick the map that meets the 
distortion constraint and minimizes the entanglement of for- 
mation and simulate this map using classical communication 
and Schumacher compression. 

A. Bounds on the Quantum Rate Distortion Function for an 
Isotropic Qubit Source 

In this subsection we consider the original case of the 
distortion being based on the entanglement fidelity, i.e., with 
distortion observable A = 1 — ip w , for an isotropic qubit 
source, meaning that the source state is a maximally mixed 
qubit state, tt = 1/2. The following theorem provides an 
exact expression for the entanglement-assisted quantum rate 
distortion function of an isotropic qubit source. 

Theorem 11: For an isotropic qubit source, the 
entanglement-assisted quantum rate distortion function 
is equal to 

(l-lH({l-D,§,%,§}) ifO<£><|, 



R q ea (D) = r * 



if | < D < 1, 



where we have used the notation H({-}) to denote the 
Shannon entropy of the probability distribution inside the 
braces {■}. 

Proof: First, recall the entanglement-assisted rate distor- 
tion function from Theorem 3 of 0: 



R q ea (D) = I mm I{R;BU 

A d(p,N)<D 



(19) 



where the distortion measure d(p,M) is related to the entan- 
glement fidelity: 

d(p,M) = l-F e (p,M). 
The mutual information is with respect to the state 
ujbb = (idfi ®Ma>^b){Vra>)' 



where ip RA , is a purification of the source state p. For an 
isotropic qubit source, we have p = tt = 1/2 and i) RA , = 
( a maximally entangled state). For any channel M, it 
has a Kraus decomposition as follows: 

Af(p) = ^2A xP Al 

X 

We also have the well-known formula for the entanglement 
fidelity (see, e.g., |29|): 

F e (p,N)=J2\MpA x )\ 2 . 

X 

Now suppose that there is some channel M achieving the 



minimum in (19 1, with Kraus operators {A^}. Consider the 
channel Mi defined as follows: 



Afi(p) = alAf(aipafjai 



where ai, i — 0, 1, . . . , 11 are the Clifford unitaries on a 
single qubit (given explicitly, e.g., in Appendix A of J9j). 
Thus, its Kraus operators are \ o\A x Oi > for any fixed i. 
For an isotropic qubit source, the channel Mi has the same 
entanglement fidelity as the original channel because 



F e (TT,Afi] 



^|Tr{™]A^}| 2 = ^J2\ Tl { a i A * a i} 

X X 

= i^|Tr{A :c }| 2 = F e (7r,AA) 

X 

Let Atw denote the "twirled version" of Af: 



(20) 



A similar calculation as in pO) reveals that the "twirled 
version" of the channel Af has an entanglement fidelity equal 
to F e (ir,Af). Also, it is well known that the Clifford twirled 
channel is equal to a depolarizing channel (a probabilistic mix- 
ture of the identity channel and the constant channel mapping 
every input state to the maximally mixed state n = ht, see, 
e.g., J9l, [33), OH). Now, each of the channels Mi leads to 
the same mutual information as the original channel M, in the 
sense that 

I(R;B) u =I(R;B) Ui , 



9 



where 

W = WflB = (id R ® AfA>^B){$RA>), 

^ = (^)rb = (idfl ® (A/;) A '^s)($flA')- 
This is due to the fact that, for a maximally entangled state 
\®RA'), lR®(<Ti)A>\<f>RA>) = (o-f)fl®l^'| $ HA') (where of 
denotes the transpose of of), and because the von Neumann 
entropy is invariant under unitaries. However, we know that 
the twirled channel cannot have a mutual information larger 
than the original channel's, due to the convexity of mutual 
information with respect to the states (uj^rb ■ 
1 



12 ^ 



I(R; B) Ui > I(R;B) 



where 



Wtw = (idi?. <8> (Ntw)A'^B)($RA')- 



This proves that the channel optimizing the expression in ( 19 1 
for an isotropic qubit source is a depolarizing channel Af p , 
hence of the form 



P 

N P {p) = (1 - p)p + 3 Kp o x 



ay poy + Z po z )- 



For these channels, a simple calculation reveals that their 
entanglement fidelity for an isotropic qubit source is equal to 
p, because the non-identity Pauli operators are traceless. Thus, 
for a given distortion D, the channel achieving the minimum 
mutual information is a depolarizing channel with p < D. The 
latter is given by 



1 



iif({l- Pj §,i,f}), 



thus finishing the proof. ■ 
In prior work [8|, Devetak and Berger showed that the 
classically-assisted quantum rate distortion function in The- 
orem 10 significantly simplifies for an isotropic qubit source. 
For convenience, we restate their result as the following 
theorem and provide a simple proof of it below. 

Theorem 12 (Devetak and Berger [8]): The classically- 
assisted quantum rate distortion function for an isotropic 
qubit source is equal to the following expression: 



R q _+(D) 



h. 



VD(1-D] 




0<D<± 
- 2 <D<1 



In the above, /i2(p) = — plogp — (1 — p) log(l — p) is the 
binary entropy for any probability p. 

Proof: First, recall the general expression for the 
classically-assisted quantum rate distortion function from The- 
orem 10 Devetak and Berger have shown that this expression 



assumes a single-letter form for the case of an isotropic qubit 
source |8 1, and we do not reproduce the proof of this statement 
here. 

So, the expression for the classically-assisted quantum rate 
distortion function in this case reduces to 



min min HtBlX)^, 

rf(7r,AA)<D{A x } 



(21) 



where the conditional entropy is with respect to the following 
state: 



urbx 



YjVr ® Ac)($rA')(Ir <8 A x ) « \x)(x\ x , 



and the operators {A^j^ are the Kraus operators for a channel 
Af meeting the distortion constraint. For simplicity, let us 
denote the optimal channel meeting the distortion constraint 



in (21 1 as Af and the optimal Kraus decomposition for the 
entanglement of formation as {A x } x , so that d(n,Af) < D. 
By the same argument as in the previous theorem, the channel 
with the set of Kraus operators {cj^s "*};*; for a fixed i 
causes the same distortion D to an isotropic qubit source while 
having the same value for the entanglement of formation. 
Also, by the same argument, the twirled channel with Kraus 
operators \^o\A x Oil \[Yl^ x ,i causes the same distortion D as 
the optimal channel, but this channel can have only a lower 
value of the entanglement of formation, due to the convexity 
of the entanglement of formation |9|. Now, the twirled channel 
is a depolarizing channel causing distortion < D to the source, 
implying that its effect on a maximally entangled state is to 
produce an isotropic state, i.e., a mixture of Bell states of the 
following form: 



RA 1 



3 *RA' 



3 ^RA> 



3^RA>> 



where p < D. In the above mixture, it must be the case that 
1 — D is larger than all of the other components whenever 
D < 1/2. In this case, it is well known that the entanglement 
of formation of such a Bell mixture is equal to the following 
expression |9|: 



■ 2 



Vd(i-d) 



For D > 1/2, the strategy requiring no quantum commu- 
nication is very simple, implying that there is no need to 
explicitly evaluate the expression in the theorem statement. 
For every copy of the source, Alice just measures it in 
the basis {|0), |1)} and sends the measurement outcome to 
Bob over the classical channel. Bob then prepares the state 
|0) or |1) depending on what he receives from Alice, and 
he forgets what Alice sent to him. This procedure prepares 
the dephased state i(|0)(0| fl <g> |0)(0| s + |1)<1|ji ® |1)<1|b) 
shared between Bob and the reference, which has distortion 
1/2 from the maximally entangled state. To achieve an even 
larger distortion with no quantum communication (if one so 
wishes), Bob could just depolarize his state locally. ■ 



The expressions from Theorems 1 1 and [T2] give two lower 
bounds on the unassisted quantum rate distortion function of 
an isotropic qubit source, plotted in Figure [2] 

We can also obtain an upper bound on the quantum rate 
distortion function for the isotropic qubit source by computing 
the unregularized entanglement of purification bound from 
Theorem 5 of Ref. [5|. In particular, a strategy to achieve the 
unregularized bound is as follows. The protocol begins with 
Alice and the reference sharing many copies of a maximally 
entangled state Qra 1 (the reduction of each of these to 
Alice's systems is a maximally mixed state). Given a distortion 
constraint D, Alice applies some isometric extension of the 
following depolarizing channel to each of her systems: 

■A/r>(p) = (1 - D)p+ f(ffxpffx +<r Y P<Jy + °z p<?z), 
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Fig. 2. Bounds on the unassisted quantum rate distortion function for an 
isotropic qubit source. We have lower bounds from the classically-assisted and 
entanglement-assisted (EA) quantum rate distortion function of this source. 
The convexified entanglement of purification (EoP) provides an upper bound 
on the quantum rate distortion function of this source. 



resulting in the following state shared between the reference 
and Alice: 



(1 - D)$ RA , 



RA' 



RA' 



RA' 



(22) 



Alice then Schumacher compresses the output of the depolar- 
izing channel and some share of the environment (which she 
possesses since she performs the isometric extension), and she 
can do this at a rate equal to the entanglement of purification 
of the above state. Furthermore, Alice and Bob can time-share 
between any two strategies of this form, implying that the 
rates and distortions of the time-shared protocol combine as 
in Remark[5] The authors of Ref. [6 1 have already numerically 
calculated the entanglement of purification of the state in ( [22] ) 
in Figure 1 of their paper. As such, the convex hull of their 
plot serves equally well as an upper bound on the quantum 
rate distortion function of the isotropic qubit source, and we 
have reproduced this plot in our Figure [5] above]^] 

VI. Quantum Rate Distortion in the Presence of 
Quantum Side Information 

In this section we study quantum rate distortion in the 
case in which Bob has some quantum side information (QSI) 
about the source, as an auxiliary resource. As mentioned in 
the Introduction, this corresponds to the following setting: 
Suppose a third party (say, Charlie) maps the source state p 
via some isometry to a bipartite state pab and distributes the 
systems A and B to Alice and Bob, respectively. The goal 

2 The only change we need to make to their plot to suit our purposes is to 
flip it with respect to the horizontal axis, eliminate values of the entanglement 
of purification beyond D = 3/4, and take the convex hull of the resulting 
curve. The maximally mixed state on Alice and the reference is a state that 
meets the distortion constraint at D = 3/4 so that at this distortion or beyond, 
there is no quantum communication needed — Alice simply discards her qubits 
and Bob prepares the maximally mixed state at his end. 



is for Alice to transfer her system A to Bob, up to some 
given distortion, using as few qubits as possible. The system 
B, which is in Bob's possession, acts as the quantum side 
information, and he can make use of it in his decompression 
task. It is required that the protocol causes only a negligible 
disturbance to the state of the reference system and Bob's 
quantum side information, in case Bob might want to use his 
system in some subsequent quantum information processing 
task. The above problem is a quantum generalization of the 
Wyner-Ziv [ 1 1 problem and is also a natural extension of the 
work of Luo and Devetak ifTTI which dealt with classical rate 
distortion theory in the presence of QSI (and thus considered 
Alice to receive a classical system instead of a quantum one). 

The rate distortion function, which we denote as R q qsi (D) 
for any given distortion D > 0, is then the minimum rate of 
quantum communication required for this task, evaluated in 
the limit in which Alice and Bob share asymptotically many 
copies of the state pab- It is defined analogously to R q (D) 
(see Section |IV}, except that the encoding and decoding maps 
are now given by 



and 



v{uT), 



where denotes the Hilbert space associated with Bob's 

QSI. 



Theorem 16 of Section VI-D gives an expression for 



R q qsi (D). Before going over to it, we briefly recall an impor- 
tant protocol of quantum information theory, namely, quantum 
state redistribution [17|, |18|. After doing so, we then employ 
it in Section VI-B| to develop a quantum reverse Shannon 
theorem in the presence of QSI — the main tool that we use to 



prove Theorem 16 



A. Quantum State Redistribution 

Quantum state redistribution is an important protocol in 
quantum information theory lfT7l . ifTSl and is defined as 
follows. Alice and Bob share many copies of a tripartite 
state pabc, where Alice holds the systems labeled by A and 
C, and Bob holds the systems labeled by B. Let the state 
Pabc be purified by a reference system R, the pure state 
being denoted as iPabcr- The task is for Alice to transfer 
the systems labeled by C to Bob, while keeping the overall 
purification i/jabcr approximately unchanged (possibly with 
the help of prior shared entanglement). The quantity of interest 
is the minimum rate of quantum communication from Alice 
to Bob needed to accomplish this task. The rate is evaluated 
in the limit of asymptotically many copies of the state pabc 
that is initially shared between Alice and Bob. 

Let Q and E denote the rates of quantum communica- 
tion and entanglement consumption^] respectively, required to 
achieve quantum state redistribution. Devetak and Yard ifTTIl 

3 A negative entanglement consumption rate implies that entanglement is 
instead generated! The reader should keep this in mind any time we refer to 
the "entanglement consumption rate" of a protocol. 
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proved that the corresponding resource inequality is given as 
follows: 

i>AC\B\R + -> l] + E [ll] > i>A\CB\R, (23) 

if and only if Q and E satisfy the following inequalities: 

Q>\I{R;C\B) 4 , 
Q + E>H(C\B).^. (24) 



The meaning of the resource inequality in ( |23| i is that, for 
n large enough, Q qubits of quantum communication and 
Ei ebits of entanglement are sufficient to transfer all n 
of the C systems from Alice to Bob, while generating an 
additional E2 ebits of entanglement, such that E = E\ — E%. 
Moreover, the fidelity of this protocol is equal to one in 
the asymptotic limit, and iPac\b\r an d iPa\cb\r denote the 
states before and after the protocol because Alice begins by 
holding the systems labeled by AC and ends by holding only 



the systems labeled by A. From ( 23 1-( 24 1, we infer that if 



\l{R; C\B)^ > H(C\B)$ then the protocol redistributes the 
system C to Bob as well as generates entanglement. In this 
case, the resource inequality takes the form: 

iJAC\B\R+ll{R\C\B)^[q^q] 

> i>A\CB\R + (\I{R; C\B)^ - H{C\B) 4 ) [qq], (25) 
which can be equivalently written as 

^AC\B\R+ \l{R\C\B)^[q ^ q] 

> Mcb\r + W(B; C% - I{A- C)^)[qq], (26) 

which follows from the definitions of the conditional mutual 
information and the conditional entropy, duality of conditional 
entropy (Lemma pi, and the fact that i^abcr is a P ure state. 

If, in contrast, 5 1(R; C\B)^ < H{C\B)^, then entangle- 
ment is consumed in order to achieve state redistribution, and 
the resource inequality can be written as 

^AC\B\R+ \l{R\C\B)^[q ^ q] 

+ i(I(A; C) 4 , ~ I(B; C)^)[qq] > ^ a \ C b\r • (27) 

From |26) and ( |2"7j ) it follows that if Alice and Bob have no 
prior shared entanglement, then they could still achieve their 
task of state redistribution: if \l{R; C\B)$ > H{C\B)^ then 
they also generate entanglement, whereas if \l(R; C\B)^, < 
H(C\B)^ then they need to invest quantum communication 
at a rate of s» H(C\B),^ qubits per copy of the source in order 
to generate the entanglement that the protocol corresponding 



to (27 1 requires. 



B. Quantum Reverse Shannon Theorem with Quantum Side 
Information 

As mentioned above, before moving on to quantum rate 
distortion theorems with QSI, we prove two quantum reverse 
Shannon theorems with quantum side information. 

Quantum reverse Shannon theorems lfl3l . 1141 deal with 
the simulation of noisy quantum channels between two parties 
(Alice and Bob, say), with the aid of noiseless resources, such 
as prior shared entanglement and quantum communication. 



Here we consider the situation in which Bob has quantum side 
information as an auxiliary resource, which he can employ in 
this simulation task. 

In particular, we consider Alice and Bob to share many 
(say, n) copies of a bipartite state pab, the systems A being 
with Alice and B being with Bob, the latter acting as the 
quantum side information. In addition, Alice and Bob can 
share entanglement with each other. The aim is for Alice and 
Bob to simulate many instances of a noisy channel JVa^B', 
such that Bob receives the output systems. The quantities of 
interest are the minimum rates of quantum communication and 
entanglement consumption required for this purpose. 

We consider two different scenarios. In the first, which is 
referred to as a feedback simulation, the environments of the 
simulated channels are required to be in Alice's possession. 
In contrast, in the second scenario, which is referred to as a 
non-feedback simulation, no such requirement is imposed. The 
minimum rates of quantum communication and entanglement 
consumption that are required in these two scenarios are given 



by Theorems 13 and 14 respectively. 

These theorems are generalizations of Theorems 3a, 3b, 
and 3c of Ref. T\3l and are interesting in their own right. 
Furthermore, both theorems are useful in establishing quantum 
rate distortion theorems that exploit quantum side information. 

Theorem 13 (Feedback QRST with QSI): If Alice and Bob 
share many copies of a state pab, tnen they can achieve a 
feedback simulation of many instances of a noisy channel 
Na-^b 1 if and only if the rates of quantum communication and 
entanglement consumption are in the following rate region: 



Q + E>H{B'\B) U , 
Q>\I(R;B'\B) U 



(28) 
(29) 



where 



t>RAB)i 



URB'B = Na-^-B' 

(f> R AB i s a purification of pab- Equivalently, we can write 
the rate of quantum communication as a function of the 
entanglement consumption rate E as follows: 

Q f , qsi (E) =max{ll(R;B'\B) ul , H(B'\B) U -E}. 

(Recall that E can be either positive or negative depending 
on whether the protocol consumes or generates entanglement, 
respectively.) The subscript / denotes that the rate corresponds 
to a feedback simulation. In particular, if there is no entan- 
glement available (E = 0), then the optimal rate of quantum 
communication is equal to 

Q f , qsi {0) = max{i/(i? ;J B'|S) w , H(B'\B) U }. 

Theorem 14 (Non-Feedback QRST with QSI): If Alice and 
Bob share many copies of a state pab, then the minimum rates 
of quantum communication and entanglement consumption 
that they need for a non-feedback simulation of many instances 
of a noisy channel JVa^B' are given by the union of the 
following rate regions: 



Q + E > ^H(B' k E B \B k ) u , 
Q>^l{R k \B' k E B \B k ) 



(30) 
(31) 
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where k is an arbitrary positive integer and the union is with 
respect to all states u> of the following form: 



Reference 



u >R k E A E B B' k B k = V E k^ E E 



(32) 

0RAB i s a purification of pab, U^-^b'E i s some isometric 
extension of the channel Ma^b'i an d VE k ^E A E B i s an 
arbitrary isometry that splits the k environment systems E k 
into two parts Ea and E E - Equivalently, we can write the rate 
of quantum communication as a function of the entanglement 
consumption rate E as follows: 

Q gsi (E) = liminf max\^-l(R k ;B' k E B \B k ) LJ , 

H fc->oo,3V {2k y ' 



1 



K H(B' K E B \B K )„-Ey (33) 

In particular, if there is no entanglement available (E — 0), 
then the optimal rate of quantum communication is equal to 

) qsi (0) = liminf max{^-l(R k ;B' k E B \B k ) u , 



^H(B ,k E B \B k )^. (34) 



Proof of Theorems 13 and 14 We prove the two theorems 
using similar arguments. The achievability parts of both of 
the above theorems follow directly by applying the protocol 
of Devetak and Yard for quantum state redistribution |17|, 
llT8l . OBI . We prove the achievability part of the non-feedback 
theorem first and then argue how the feedback version is a 
special case of it. 

To start with, Alice, Bob, and the reference share n copies 
of the state 4^rab' which is a purification of the state pab- 
Alice locally applies an isometric extension U A ^ B , E of the 
channel Na^b 1 to each system A in her possession, and then 
applies an environment-splitting isometry V e ^ EaEb to each 
system E that results. At this point, the three parties share n 
copies of the following pure state: 



Ure a E b B'B 



= V, 



E^E A E E 



{Ua->-b' e(.4>rab)) > 



where the reference has R, Alice has EaE b B', and Bob has 
B. Alice would like to transmit all of her E B B' systems to 
Bob, and she can do this by using the protocol of quantum 
state redistribution. 

By setting C = B'E B , A = Ea, and 4>racb = 



ure a e b b' b in (23 H 27 1, we infer that the following rate 
region is achievable with quantum state redistribution: 

Q>\I{R;B'Eb\B) u , 
Q + E> H(B'E B \B) U . 

Now, a protocol that achieves the regularized rate region in 
( 30 1-( 31 1, is very similar, but Alice and Bob instead act on 



blocks of k states at a time. That is, they share n copies of 
the state {4>RAB)® k (^ m ey are allowed access to an arbitrary 
number of shares of this state, then they can block them in this 
way). Alice applies n instances of the isometry (U A ^_^ B , E )^' k 
to her systems A n and then applies an environment splitting 
isometry V E u^ E E to each E k resulting from the previous 




Fig. 3. A general protocol for simulating a quantum channel when quantum 
side information and shared entanglement are available. 



step. By the same arguments as given above, the rate region 



in ( 30 1-( 3 1 1 is achievable, where the division by k is needed 
to obtain the rates. 

The achievability of the feedback protocol in Theorem 13 
follows as a special case of the above. In particular, there is 
no splitting of the environment into two parts, so that Alice 
merely redistributes the B' system to Bob. Thus, the rate 
region in (|28]>-([29| is achievable. Furthermore, there is no need 
to double-block the protocol as above because our converse 
theorem for this case demonstrates that it is not necessary to 
do so. 



We now prove the converse part of Theorem 14 that is 



we establish that the bounds ( 30 >-( 31 1 hold for any protocol 



that results in a non-feedback simulation of the noisy channel 
Na-^B'- The most general protocol begins with the reference, 
Alice, and Bob sharing n copies of 4> p R ab> wnere n lS some 
arbitrarily large positive integer. Alice and Bob also share 
some entangled state &t a t b on systems Ta and T E , which 
they can use to help them in their task. In particular, the 
Schmidt rank of this entangled state is equal to 2 nE , so 
that E quantifies the rate of entanglement consumption. Alice 
performs some encoding map £ on systems A n and T4 which 
produces a system W as output. She sends system W to Bob, 
who then performs a decoding map T> on systems W, T E , 
and B n . This decoding map has two outputs B' n and B n , 
where B' n approximates the output of the channel simulation 
and B" represents an approximation of Bob's quantum side 
information. If the protocol is any good for accomplishing the 
task of a non-feedback channel simulation, then the output of 
this simulation protocol and the output of the ideal protocol 
should be asymptotically indistinguishable as n becomes large. 
That is, for any arbitrary e > 0, for n large enough, it should 
hold that 



T> WT B B'^B'"B"( S ^T A ^w(((l) R AB)' S ' n 



®T A T B )) 

< e. (35) 



Figure [3] illustrates the protocol discussed above. 

A useful observation for proving the converse is that an 
arbitrary encoding map £ and a decoding map T) can be 
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simulated by a protocol involving only isometric operations. 
In particular, we can replace the encoding £ with an isometric 
extension U £ that has as output the original output W and 
an environment system E\. Let a = URE x WT B B n denote the 
pure state shared between the reference, Alice, and Bob after 
the action of U . We can also replace the decoding map T> 
with an isometric extension of it that has as output the original 
outputs B' n B n and an additional environment system E 2 . Let 
UJ RE 1 E 2 B' n B n denote the pure state resulting from applying 
an isometric extension of the decoder to the state a. 

Due to monotonicity of the trace distance under partial trace, 



the condition in p5\ implies that the inequality in ( 36 1 holds, 



with the partial trace of the states in ( 35 1 being taken over the 
system B' n . 

Since ^ J RE 1 E 2 B' n B 71 * s a purification of the first term in 
the trace distance in (36i, and <JRE x WT B B n is a purifica- 
tion of the second term in the trace distance, Uhlmann's 
theorem [20|, 11361 implies the existence of an isometry 
V = Ve 1 wt b ->e 1 e 2 b'™ such that the trace distance between 



V(a RElW T B B") and ^ REiB2 b'«b^ is no lar S er than 2 V^- Let 
u/ denote the state resulting from applying V to c re 1 wt b B n • 
Thus, the original decoder can be simulated by the isometry 
V which does not act on Bob's quantum side information (we 
should expect for this to be possible, given that the condition 



in (35 1 implies that Bob's QSI should not be disturbed too 
much). We also observe that uj' is e-close in trace distance 



to a state of the form in (32i because U £ and V act only 



on Alice's systems. By applying Uhlmann's theorem once 
again and the triangle inequality, we conclude that there 
exists some isometry U = Ue™-+e 1 e 2 such that when W 
is applied to <J , the resulting state is close in trace distance to 
(U A ^ B , E (4> RAB ))® n . Figure |4] summarizes the observations 
made in this paragraph and the previous one. 
Consider the following state: 



A-^B'E 



Wrab)) 



(37) 



where (j) RAB is a purification of pab, U a _^ b , e is a Stinespring 
isometry of the noisy channel JV which is to be simulated, and 
UE n ^E ± E 2 is an isometry. We proceed to find a lower bound 
on the optimal rate Q of quantum communication needed for 
the channel simulation as follows: 

nQ = log(dim-H^) > H(W) a 

= H(WT B B n ) a - H(B n T B \W)<y 
= H(E 2 B' n B n ) UJ - H(B n T B \W) a 

> H(E 2 B' n B n ) UJ , ~ ne' - H{B n ) a 

- H(T B ) a 

= H(E 2 B' n B n ) UJ > - ne 1 - H{B n ) u , 

- H(T B ) a 

> H(E 2 B' n \B n ) UJ ' -nE- ne 1 
>H{E 2 B' n \B n ) T -nE-2ne'. (38) 

The first equality is an entropy identity. The second equality 
follows because entropy is invariant under the action of an 
isometry (in this case, the isometry is U ). The second 
inequality follows from Uhlmann's theorem (as mentioned 



above), the Alicki-Fannes' inequality (continuity of entropy) 
with an appropriate choice of e' (a similar convention to 
what we had in our previous converse theorems), and from 
subadditivity of entropy: 

H{B n T B \W)„ < H(B n T B ) a < H(B n ) a + H(T B ) a . 

The third equality follows because the map 

yE 1 WT B ^E 1 E 2 B'K doefJ nQt act Qn the system . 

The third inequality follows because H(T B ) a < nE. The 
final inequality follows from another application of the 
Alicki-Fannes' inequality to the state t defined in ([37}. 
We prove the second bound in plj: 

I{R n ; B' n B n E 2 ) T < I{R n ;B' n B n E 2 ) UJ + n2e' 

< I(R n ;B n WT B ) a +n2e' 

< n2Q + I(R n ; B n T B ) a + n2e' 
= n2Q + I(R n ;B n ) a + n2e / 

= n2Q + I(R n ;B n ) T + n2e' (39) 

The first inequality results from the fact that r is e-close to 
u/, lu' is e-close to uj, and from applying the Alicki-Fannes 
inequality. The second inequality follows from quantum data 
processing. The third inequality follows from the following 
property of quantum conditional mutual information: 

I(A; BC) = I(A; B) + I(A; C\B) 

= I(A; B) + H(A\B) - H(A\BC) 
< I{A;B) + H(A) + H(A\D) 
<I(A;B) + 2log\A\, 

where the entropies are evaluated on a tripartite state pabc 
purified by some state on a purifying system D. The first 
equality in the chain in ( f3"9| ) follows because T B is product 
with systems R n and B n for the state a. The above inequalities 
then imply that 

n2Q > I(R n ; B ln E 2 \B n ) T - n2e', 

by finally applying the chain rule for quantum mutual infor- 
mation. 



To obtain a converse proof for Theorem 13 we can exploit 
the above converse with just one further observation. Since a 
feedback simulation requires Alice to possess the full environ- 
ment of the simulation, the final state on R n E\B' n B n must 
be a pure state and E\ must be unitarily related to the E n 
system of (U^_^ B , E ((j) RAB ))^' n . Thus, it must be the case 
that the system E 2 is product with R n E x B' n B n . This final 
observation leads to a single-letterization of the above bounds 
as follows: 

nQ > H(E 2 B' n \B n ) T - nE - 2ne' 
= H{B ln \B n ) T -nE-2ne' 
= n[H{B'\B) MA ^ Bl{pAB) -E-2e% 

and similarly, 

n2Q > I(E 2 B' n ; R n \B n ) T - 2ne' 
= I(B' n -R n \B n ) T -2ne' 



I(B';R\B) 



2e' 
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TlE 1 E 2 B<™ | 



v 



u 



£ 



R.AB 



, <8>ri 



$™))} 



i <e, 



(36) 
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Fig. 4. The protocol on the left is a simulation of the general protocol in Figure[3] in which Alice acts with an isometric extension of her encoder £ and 
Bob acts with an isometric extension U' D of his decoder T>. Due to the fact that the simulation causes only a negligible disturbance to the state of R n B n , by 
Uhlmann's theorem, there exists an isometry VE 1 wT B ^E 1 E 2 B'n such that the final state on the left is approximately equal to the final state on the right. 



The main step in the above equalities is to exploit the obser- 
vation that E2 must be product with the other systems for a 
feedback simulation. ■ 

C. On a General Quantum Reverse Shannon Theorem with 
QSI 

With the above tensor-power quantum reverse Shannon 
theorem with QSI in hand, one might be tempted to pursue a 
general form of this theorem that holds whenever the input to 
the channel is a general, non-IID state entangled with a system 
available at the receiver's end (non-IID input and quantum 
side information). From the above theorem and the techniques 
developed in Refs. lfT3l . Ifl4"ll . we suspect that it is possible to 
show that the following rate of quantum communication is 
necessary and sufficient for simulating an IID channel acting 
on a general input entangled with a system at the receiving 
end, whenever unlimited entanglement in any form is available 
between the sender and receiver: 



i max I(R; B'\B) 



■V, 



(40) 



Also, it is known from Refs. lfl"3ll . [14| that the following 
rate of quantum communication is necessary and sufficient 
for simulating an IID channel acting on a general input (ne- 
glecting any quantum side information), whenever unlimited 
entanglement in any form is available between the sender and 
receiver: 



\ max/(i?; B' 



<«-)• 



The following theorem clarifies that these two rates are in fact 
equal, implying that pursuing a general quantum reverse Shan- 
non theorem with QSI is a pointless task if the conjectured 
rate in ( |40| > is correct (at least for a feedback simulation). The 
reason that such a relation should hold is that the theorems 
from Refs. Ifl3l . Ifl4l are simulating an IID channel with 
respect to the diamond norm|^] which is known to be robust 
under tensoring with other systems on which the channel does 
not act. 

4 See, e.g., Refs. 1131 . 1141 for a definition of the diamond norm. 



Theorem 15: The following identity holds 

Tnax.I{R-,B')x w = max I (R; B'\B) M ■ 

4>RA ' VRAB 

where each maximization is over pure states. 

Proof: We first prove the following inequality: 

max.I{R;B')x m < max I{R;B'\B) M 

4>RA ' VRAB 

Let ijj* RA be the state that achieves the maximum of the LHS. 
Then the state ^ra ® fB (f° r an y P ure state ipg) leads to 
AfA-tB'iipRA) ® f B at me channel output, and it is a special 
pure state included in the maximization on the RHS, so that 

I{R; b ')m a ^ b , («/>*) = I{ R \ B '\ B )u A ^ B ,{^-®v) 
< mvxI{R;B'\B) MA ^ B , {4>) . 

<PRAB 

We now prove the other inequality: 

waxI(R;B') Af w > max I(R; B'\B) N {<t>) . (41) 

Vra <prab 

Let <P*rab t> e the pure state that achieves the maximum on the 
RHS. Then we have that 

I(R; B'\B)M A ^ B ,(<f,*) = I{RB\B')j^ A ^ Bl{r) 
- I{B]B') MA ^ B , {r) 

< I(RB;B') f j A ^ B ,( < p** ) 

< maxI(R;B')tf w . 

WRA 

The first equality follows from the chain rule for quantum 
mutual information, and the first inequality follows because 
I(B; B') > 0. The final inequality follows because <fi* is some 
pure bipartite state with respect to the cut RB\B', where the 
systems R and B purify the input to the channel, and this is 
of course a particular kind of state in the maximization on the 
LHS of (glj. ■ 

D. Quantum Rate Distortion with Quantum Side Information 

Having established the quantum reverse Shannon theorem 
with QSI, we now prove a theorem characterizing the quantum 
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rate distortion function in the presence of QSI, which is 
denoted as R q qsi (D) and was introduced at the beginning of 
Section IVJ 

Theorem 16: Consider a bipartite state pab, obtained by 
the action of an isometry on the source state of a memoryless 
quantum information source. Suppose Alice has the system A 
and Bob has the system B, the latter acting as QSI. Let 4> P RAB 
be a purification of pab- Then for any given distortion D > 
0, the quantum rate distortion function with QSI, evaluated 
under the condition that the protocol causes only a negligible 
disturbance to the joint state of the BR systems, is given by 

Rl si (D)=liml min lM PAB f k ,M^) 

H fc-i-oo k tf{k) , d(p,Af( k ))<D v ' 

(42) 

where 



I P (pAB, Na^b>) 



V E - 



inf 



max <, ^I(R;B'E B \B) U , 



H(B'E B \B) 



with ujre a e b B'B — Ve^e a e b 
above, M a ^b> and A/" (fe) : V(H A * 



Ua-^b 1 e^rab 



)). In the 
V(U B h) are CPTP 

maps, U A _^ B , E * s an isometric extension of Na^B', and 
Ve^e a Eb i s an environment-splitting isometry. 

Proof: The achievability part of this theorem follows 



easily from Theorem 14 In particular, we just fix the map 



that achieves the minimum in (42 1, and it easily follows that 



performing the protocol from Theorem 14 leads to a state on 
R n and B' n that has distortion no larger than D with the 
original state on R n and A n . We can block the protocol to 
achieve the regularized formula. 

We now prove the converse. As before, the most general 
protocol begins with the reference, Alice, and Bob sharing 
n copies of the state <Prab- Alice performs some encoding 
map £ on the systems A n , obtaining a quantum system W . 
She sends system W to Bob using noiseless qubit channels. 
Bob feeds this system, and his quantum side information B n , 
into a decoding map T>, which produces as output systems 
B' n and B n . We demand that the distortion of the state of 
systems R n B' n with respect to the state of R n A n at the start 
of the protocol be no larger than D. Also, we demand that the 
decoder causes only an asymptotically negligible disturbance 
of the joint state of the reference and the quantum side 
information, in the sense that for any e > 0, for n large 
enough: 



Tr_B'n I D 



IV B" 



>B m B n 



< e. (43) 



Like our other converses, the key to this proof is the 
realization that the above general protocol can be simulated by 
one in which we exploit an isometric extension of the encoder 
U £ , which maps A n to W and an environment E\. Let a 
denote the overall state after U £ acts (so that ctrk-EiWB™ is a 
pure state). We also exploit an isometric extension U v of the 
decoder, which maps WB n to B' n B n and an environment E 2 . 



Let oj = <^Rn ElE . )B , nB n denote the overall state after U v acts. 
We proceed with bounding the rate Q of noiseless quantum 



communication by following the same steps as in (38 1 and 



(39 1, but ignoring the entanglement assistance: 

nQ > H(E 2 B' n \B n ) hJ > — ne', 



> H(E 2 B \B n ) L 
nQ > \l{E 2 B' n ;R n \B n ) u 



ne 



(44) 
(45) 



Putting things together, we obtain the following lower bound 
on the quantum communication rate: 



nQ > max -I{E 2 B' n - R n \B n ) u , , H(E 2 B' n \B r ' 



ne 



> inf 



max<j ^I(E 2 B ,n ;R n \B n ) 



ne 



> 



H{E 2 B' n \B n \ 

l P {{PA B T n ,Af {n) 



ne ' 



■ d(p®",A/"<™))<D 

where M {n) : V(Ha*) -> V{U B ") denotes a CPTP map. 
The first inequality follows by combining ( [44] ) and ( |43| ). The 
second inequality follows by performing an optimization over 
all possible splits of the environment. The final inequality 
results from minimizing I p over all maps J\f( n > that act only 
on A n and meet the distortion criterion. ■ 

E. Entanglement-Assisted Quantum Rate Distortion with 
Quantum Side Information 

If in addition to Bob having quantum side information, Alice 
and Bob have prior shared entanglement (over systems Ta 
and T B ) as an auxiliary resource, then the corresponding rate 
distortion function, for any given distortion D is denoted as 
R q eaqs i{D). It is defined analogously as in (10 Mill, except 
that Bob can use the QSI in his decompression task. Hence 



the decompression map is given by 



A J 



We prove the following theorem: 

Theorem 17: Suppose Alice and Bob share entanglement 
and a state pab (obtained from a memoryless quantum in- 
formation source), such that the system A is with Alice and 
B is with Bob. Let ^rab ^ e a purification of pab- Then 
the quantum rate distortion function, R q ea qs i{D), evaluated 
under the condition that the protocol causes only a negligible 
disturbance to the joint state of BR is given by 



1 



min I(R:B'\B) C 

M : d(p,Af)<D 



(46) 



where the state a is defined as: 

ORB'B = Na-^b> 



Vrab)- 



Proof: The achievability part follows easily from the 
protocol for quantum state redistribution. Fix JV to be the 
CPTP map which achieves the minimum in ( f4"6| ) for a given 
distortion D. It is straightforward to compute this minimum 
because the information quantity I(B'; R\B) a is convex in the 
map Af. This follows easily from the identity I(B'; R\B) a = 
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I(B'B; R) a - I(B; R)^, and the fact that the map, TV, acts 
only on the system A. The reference, Alice, and Bob share 
n copies of 4> P RAB - First Alice acts on her system A n with 
many instances of an isometric extension U^_^ B , E of the 
map Na^b 1 ■ Let (4>RB'EB)® n denote the resulting pure state, 
with the systems B' n E n being in Alice's possession. Note 
that since Af is chosen to be the CPTP map which meets the 
distortion constraint d(p,Af) < D, the rate-distortion task is 
completed if the systems B' n are transmitted to Bob faithfully 
in the asymptotic limit (n — > oo). This is accomplished by 



using the protocol of quantum state redistribution. From ( 26 1 
it follows that the relevant resource inequality is given by: 



J R\B'E\B/ 



\I{R- B'\B)^[q 

> 



q] + \l{E;B%«[qq] 
\l(B;B') r [qq]. (47) 



J R\E\B'BI 

Clearly, with unlimited entanglement, the protocol accom- 
plishes the state redistribution task with a rate of quantum 
communication given by (|46l. 

We now prove the converse part of the above theorem. 
This proof bears some similarities with our converse proof 
of Theorem 3 of Ref. |5| and with the converse proof of 
Theorem 6 of |12|. The most general protocol begins with the 
reference, Alice, and Bob sharing the state (0^43)®", and 
Alice and Bob sharing entanglement in the systems and 
Tb, respectively (such that the dimensions of Ht a and Wt b 
are no larger than 2 nE ). Alice then acts with some encoding 
map £ on A n and Tg, producing a system W . Let a denote the 
state shared by Alice, Bob and the reference after the encoding. 
She sends W to Bob, who then acts with a decoding map T> 
on W, his share Tg of the entanglement, and his quantum 
side information B n to produce systems B' n and B n . Let oj 
denote the final shared state. Without loss of generality, we 
can simulate the above protocol by considering an isometric 
extension of the encoder that produces systems W and E as 
output. We demand that the protocol causes only an asymptot- 
ically negligible disturbance to the state on the reference and 
Bob's systems, in the sense that, for any e > and n large 



enough, the inequality in ( 48 1 should hold. Then the rate of 
quantum communication Q = (l/n) log(dim'Hw'), needed for 
the rate-distortion coding task, satisfies the following bound: 



2nQ>nRl a>qsi (D)-3ne', 



(49) 



for any e' > and n large enough. It is proved by employing 
standard entropic identities and inequalities, e.g. the quantum- 
data processing inequality, the Alicki-Fannes inequality |19|, 
and the superadditivity of the quantum mutual information 
(Lemma [3}. It also relies on the fact that the state R n B n is 
e-close in trace distance to a tensor-product state. For the sake 
of completemess, we have included the proof in Appendix |A| 



VII. Conclusion 

We have extended quantum rate distortion theory by consid- 
ering auxiliary resources that might be available to the sender 
and receiver. The first setting we considered is quantum rate 
distortion coding with the help of a classical side channel. 
Our result is that the regularized entanglement of formation 



characterizes the quantum rate distortion function, extending 
earlier work of Devetak and Berger [8 1. We also combined this 
bound with our entanglement-assisted bound from Ref. (5) to 
obtain the best known bounds on the quantum rate distortion 
function for an isotropic qubit source. The second setting we 
considered is quantum rate distortion coding with quantum 
side information available to the receiver. Before proving 
results in this setting, we proved a quantum reverse Shannon 
theorem with quantum side information (for tensor-power 
input states), which naturally extends the quantum reverse 
Shannon theorem (for tensor-power inputs) in Ref. [13|. The 
achievability part of this theorem relies on the quantum state 
redistribution protocol ifTTl . JTS), while the converse relies 
on the fact that the protocol can cause only a negligible 
disturbance to the state of the reference and Bob's quantum 
side information. This result naturally leads to quantum rate- 
distortion theorems with quantum side information, with or 
without entanglement assistance. 

All of our proofs rely on one particular approach to quantum 
rate distortion theory: exploiting a quantum reverse Shannon 
theorem for the task of quantum rate distortion coding. It 
would be a breakthrough for this theory if one could develop a 
different approach that leads to better characterizations of lossy 
quantum data compression tasks, beyond the ones presented 
here. 
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Appendix 

First recall that the quantum rate distortion function has the 
following characterization: 



R q (D) = lim y 

k— >oo k 



min E, 

AA fc > : d(p,Af( k ))<D 



We would like to show that the expression on the RHS above 
is convex in D: 



R q {D x ) < XR^D,) + (1 - \)R q (D 2 ), 



(50) 
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(48) 



where £>a = ADi + (1 — X)D 2 . We will choose fci and k 2 to 
be large integers such that 



X. 



(We can actually just take ki = [afe] f° r a = [t — 1] S so 
that there is just one integer to consider.) For ki and where 
i G {1, 2}, let M^ ki) be the CPTP map that minimizes 



nun Eplp""*,^** 

Art**) : d(pM ki h<D t ' 



Thus, whenever the distortion measure under consideration is 
linear and averaged, we have that the distortion caused by the 
map J\f[ kl ^ ® N 2 k ^ is approximately equal to D\. Then we 
have that 



1 



k\ + k 2 



jmin E; 

AA fc i + fc 2) : d(p,Af< k i + k 2))<D 



< 



1 



r(fe) 



< 



fci + k 2 
1 



fci + fc 2 
fci / 1 

fci + fc2 V^l 

„ JW! 

fci + k 2 \ k 2 



E p ( P ®^ +k *\M[ kl) 



+k 2 



E, 



+ (1-A) lE^X^). 



The important second inequality follows from subadditivity of 
the entanglement of purification. Since the above relation holds 
for every choice of k 2 and k\ = \ak 2 ~\ and the corresponding 
minimizing maps A/f and A/f a) , it follows that it holds in 
the limit, implying ( |5Q| as desired. Clearly, this argument is 
very similar to the observation regarding time-sharing of rate 
distortion codes in Remark [5] 

Here we give the details of the proof of the converse part of 



Theorem 17 which deals with entanglement-assisted quantum 
rate distortion with quantum side information. Continuing 



from (48 i, we obtain a lower bound on the optimal rate Q 
of quantum communication needed for this 

2nQ = 21og(dim"H M /) 

> 2H{W) a 

= H(W) a + H{R n T B B n E) a 

= H(W) a + H(R n T B B n E) a - H{WR n T B B n E) a 
= I{W;R n T B B n E) a 

> I(W; R n T B B n ) a 

= I(WT B B n -R n ) a +I(W-T B B n ) a - I(R n :T B B n ) a 



= I(WT B B n ; R n )„ + I(W; T B B n ) a - I(R n ; B n ) a 

> I(WT B B n ; R n ) a - I(R n ; B n ) a 

> I(B' n B n ;R n ) u - I(R n ; B n ) u - ne' . 



In the above, the states a and u> are as defined above ( |48| l. 
The first inequality follows because the entropy of a system 
is never larger than the logarithm of its dimension. The 
first equality follows because the entropy of the marginals 
of a pure bipartite state are equal. The second equality fol- 
lows because the entropy of a pure state is equal to zero, 
so that H(WR n T B B n E) = 0. The third equality is an 
identity. The second inequality follows from quantum data 
processing. The fourth equality follows from an identity for 
the quantum mutual information. The fifth equality follows 
because R n B n and T B are in a product state, so that 
I(R n ;T B B n ) a = I{R n -B n ) a . The third inequality follows 
because I(W;T B B n ) a > 0. The last inequality follows 
from quantum data processing and the assumption that that 
protocol causes only a negligible disturbance to the state of 
the reference and the receiver (the term e' arises from an 
application of the Alicki-Fannes' inequality, where e' is a 
function of e such that lim E _>.o e'(e) = 0). 

Let B' k , k = 1, 2, . . . , n denote the subsystems (with Hilbert 
space T~L B i) constituting the system B' n , Similarly, let Bk and 
Rk (k = 1, 2, ... , n) denote the corresponding subsystems of 
B n and R n respectively. Then 



RHS of (|48j> > ^ [l{B' k B k ;Rk)u - I{R k ;B k \ 

k 

= Y,I(B' k ;Rk\B k ) u -2ns'. 



- 2ns' 



The first inequality follows from superadditivity of quantum 
mutual information (Lemma [3]) and from the fact that the 
state on R n B n is e-close in trace distance to a tensor-product 
state (see Lemma 10 of Ref. [21]). The second equality 
follows from the identity I(B' k B k ; R k )u - I{Rk\B k )u> = 
I{B' k ;Rk\B k )ui- 

At this point, we proceed with an argument similar to 
that in the converse proof of Theorem 6 of lfl2l . Since the 
state on R n B n is not disturbed too much, by Uhlmann's 
theorem we know that there exists a CPTP map acting Alice's 
system alone, such that the conditional mutual information 
of the state io' resulting from this map is a lower bound on 
the conditional mutual information I(B' k ; R^Bk)^ for every 
ke {1,2,... ,n}: 

>Y,I{B' k -R k \B k )u, ~Zns' . 



Continuing, we have the same set of inequalities as in the last 
part of the proof of Theorem 6 of [ 12 1 (with R ea .qsi replacing 
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> nR 9 ea<qsi (D) ~ 3ne'. 
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